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It has been hypothesized that in the era just before the last universal common ancestor emerged, 
life on earth was fundamentally collective. Ancient life forms shared their genetic material freely 
through massive horizontal gene transfer (HGT). At a certain point, however, life made a transition 
to the modern era of individuality and vertical descent. Here we present a minimal model for this 
hypothesized “Darwinian transition.” The model suggests that HGT-dominated dynamics may have 
been intermittently interrupted by select ion-driven processes during which genotypes became fitter 
and decreased their inclination toward HGT. Stochastic switching in the population dynamics with 
three-point (hypernetwork) interactions may have destabilized the HGT-dominated collective state 
and led to the emergence of vertical descent and the first well-defined species in early evolution. A 
nonlinear analysis of a stochastic model dynamics covering key features of evolutionary processes 
(such as selection, mutation, drift and HGT) supports this view. Our findings thus suggest a viable 
route from early collective evolution to the start of individuality and vertical Darwinian evolution. 
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I. INTRODUCTION 
A. The last universal common ancestor 

In the final chapter of “On the Origin of Species”, 
Charles Darwin speculated that all life on earth may have 
descended from a common ancestor. As he observed, 
“all living things have much in common, in their chem¬ 
ical composition, their germinal vesicles, their cellular 
structure, and their laws of growth and reproduction. 

... Therefore I should infer from analogy that proba¬ 
bly all the organic beings which have ever lived on this 
earth have descended from some one primordial form, 
into which life was first breathed” [Ij. 

A century after Darwin, molecular biology provided 
new lines of circumstantial evidence for a universal com¬ 
mon ancestor. All organisms were found to use the same 
molecule (DNA) for their genetic material, as well as a 
canonical look-up table (the genetic code) for translat¬ 
ing nucleotide sequences into amino-acid sequences 
Further clues came from cross-species comparisons of the 
molecules involved in the most fundamental processes of 
life, such as protein synthesis, core metabolism, and the 
storage and handling of the genetic material. The first 
such analysis [^, based on snippets of ribosomal RNA, 
provoked a revolution in our understanding of life’s fam¬ 
ily tree @-[§1. It indicated that life is divided into three 
different domains: the Archaea, the Bacteria and the 
Eucarya 0, i S- Later studies using other molecular 
sequences placed the root of the tree, corresponding to 
the last universal common ancestor, somewhere between 
the Bacteria and Archaea roughly 3.5 — 3.8 bil¬ 

lion years ago. The nature of the last universal common 


ancestor, however, remains unresolved: Was it prokary¬ 
otic or eukaryotic? Did it thrive in extreme or moderate 
temperatures? Was its genome based on RNA or DNA? 
For a review, see Ref. [1^ and references therein. 


B. The era of collective evolution 

Our work in this paper was inspired by a conjecture 
proposed by Woese and his colleagues |l5l4l8j . Accord¬ 
ing to this conjecture, the last universal common ances¬ 
tor was a community, not a single creature. It marked a 
turning point in the history of life: before it, evolution 
was collective and dominated by horizontal gene trans¬ 
fer; after it, evolution was Darwinian and dominated by 
vertical gene transfer. 

In Woese’s scenario, life in the epoch leading up to 
the universal ancestor was intensely communal. It was 
organized into loose-knit consortia of protocells far sim¬ 
pler than the bacteria or archaea we know today. Woese 
and Fox 0 called those hypothetical ancient life forms 
“progenotes.” The term signifies that the coupling be¬ 
tween genotype and phenotype had not yet fully evolved, 
mainly because the process for translating genes into pro¬ 
teins had not yet fully evolved either. A rudimentary 
form of translation existed, but it was ambiguous and 
hence had a statistical character. Instead of producing 
a single protein, early translation produced a cloud of 
similar proteins. This ambiguity in protein synthesis in 
turn limited the specificity of all the progenote’s interac¬ 
tions. For example, lacking the large, complex proteins 
necessary for accurate copying and repair of the genetic 
material, the progenote’s genome was tiny and subject 
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to high mutation rates. 

Progenotes were not well-defined organisms as such, 
because they had no individuality and no long-term ge¬ 
netic pedigree. Their genes and component parts could 
come and go, being swapped in or out with other mem¬ 
bers of the community via horizontal transfer. But be¬ 
cause biochemical innovations produced by any member 
of the community were available to all, evolution at this 
time was rapid—probably more rapid than at any time 
since. Selection acted on whole communities, not on in¬ 
dividual progenotes. Those communities that were bet¬ 
ter at sharing their biochemical breakthroughs flourished. 
Out of this cauldron of evolutionary innovation, the uni¬ 
versal genetic code and its translational machinery co¬ 
evolved, in response to the selective pressures favoring 
efficient sharing and interoperability. 

Vetsigian, Woese, and Goldenfeld 0 confronted and 
constrained these speculations with mathematical anal¬ 
ysis and computer simulations. Going beyond Woese’s 
conjectures, they probed early evolution scientifically by 
interpreting the available data on the genetic code. Their 
dynamical model for the evolution of the genetic code (l9| 
showed that a collective state of life is required to obtain 
the observed 0E1 statistical properties of the code, in 
particular, its simultaneous universality and optimality. 
A later study by Goldenfeld and colleagues provided 
further evidence that only a collective state of life could 
have created the highly optimized code used by all life 
today (^ . 


C. The Darwinian transition 

How did the era of collective evolution come to an end? 
Woese speculated that as the translation process began 
to improve, and as progenotic subsystems became in¬ 
creasingly complex and specialized, it would have become 
harder to find foreign parts compatible with them. Thus, 
horizontal gene transfer would have become increasingly 
difficult. The only possible modifications at this point 
would have come from within the progenote’s lineage it¬ 
self, through mutation and gene duplication. It was in 
this way that the progeno tes would have made the Dar¬ 
winian transition [l7|, [iSj to become “genotes,” i.e., life 
forms with a tight coupling between their nucleic acid 
genotypes and their protein phenotypes, and that could 
therefore evolve through the familiar Darwinian process 
of vertical descent. 

The model considered below is an attempt to explore, 
in mathematical terms, how the Darwinian transition 
from the collective state to the modern era of individ¬ 
uality might have taken place. Our approach shares with 
Ref. [3| the outlook that a dynamical systems calcula¬ 
tion should be devised to support or refute the hypothe¬ 
ses considered. Our results lend support to the proposed 
collective state of life (l5l - [l9| by providing a potential 


mechanism for the exit from that state. 


D. Horizontal gene transfer 


Over the last decades more and more evidence has 
accumulated that, besides selection, mutation, and 
drift [23|, another process drives evolution: horizontal 
gene transfer. Here we briefly review the main points 
about horizontal gene transfer relevant to the mathemat¬ 
ical model developed below. 

While reproduction implies a vertical transfer of genes 
from one entity to the next in the phylogenetic tree, there 
are also processes in which possibly unrelated individuals 
exchange genetic material during their lifetimes, i.e., hor¬ 
izontally in the sense of the tree. This transfer of genes 
within one generation is consequently termed horizontal 
gene transfer (HGT) [H-i^ or lateral gene transfer. It 
is now widely accepted that HGT is a fundamental driv¬ 
ing force of evolution [25, 2^3^, and that its existence 
raises profound theoretical problems for evolutionary bi¬ 
ology. For example, the l ong standing problem of defin¬ 
ing bacterial species (sol . [3#l38| is due, in part, to the 
promiscuous use of HGT by bacteria. A recent primer on 
horizontal gene transfer and its pot ential for evolutionary 
processes in general is given in |39| . 

As discussed above, if HGT was rampant in the early 
stages of evolution, the last universal common ancestor 
was a community, not a single organism 0113, EES- 
In this collective state, individuals could not yet be dis¬ 
tinguished, as each progenote’s genes were frequently ex¬ 
changed through HGT. In terms of the model to be devel¬ 
oped below, the total pool of genotypes in the collective 
state would be spread out and thus broadly distributed 
in the state space of all theoretically possible genotypes. 
Gonversely, a genotype distribution that is highly local¬ 
ized in state space, being concentrated on just one or a 
few genotypes, would be the model’s version of a well- 
defined species. 

Woese postulated that as the collective state of the 
progenote population slowly evolved toward higher com¬ 
plexity, its rate of HGT slowly decreased 0. At some 
point the system crossed the “Darwinian threshold” 0. 
Then natural selection instead of HGT started to dom¬ 
inate the dynamics. The fitter individuals were selected 
for and the first species emerged from the distributed 
state. In the colorful language of Dyson |4l| : 


But then, one evil day, a cell resembling a 
primitive bacterium happened to find itself 
one jump ahead of its neighbors in efficiency. 
That cell, anticipating Bill Gates by three bil¬ 
lion years, separated itself from the commu¬ 
nity and refused to share. Its offspring be¬ 
came the first species of bacteria—and the 
first species of any kind—reserving their in- 
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tellectual property for their own private use. 
With their superior efficiency, the bacteria 
continued to prosper and to evolve separately, 
while the rest of the community continued its 
communal life. Some millions of years later, 
another cell separated itself from the commu¬ 
nity and became the ancestor of the archaea. 
Some time after that, a third cell separated 
itself and became the ancestor of the eukary¬ 
otes. 


After making a Darwinian transition, evolution pro¬ 
ceeds in the familiar vertical manner, being driven by se¬ 
lection, mutation, and drift (j^ . with HGT playing only 
a minor role. Such Darwinian dynamics have, of course, 
been studied extensively in both experimental and model 
settings (2^ . l43 - i46| . Compared to the dynamics of HGT 
their properties are relatively well understood. Recently, 
potential influences of HGT on such evolutionary dynam¬ 
ics have been investigated 16, Some mathe¬ 

matical models of HGT have focused on how it can in¬ 
crease a population’s fitness in Darwinian evolution [M |. 

Keep in mind, however, that the hypothesized HGT as¬ 
sociated with progenotes and the Darwinian transition, 
being associated with ribosomal genes and the rest of 
the core machinery of the cell, would have been of far 
greater evolutionary significance than the HGT of, say, 
antibiotic resistance genes seen in bacterial communities 
today. In Woese’s scenario, the ancient form of HGT was 
rampant, pervasive, and tremendously disruptive and in¬ 
novative. It was the prime mover in shaping the fabric 
of the cell fl8| . 

We would like to understand what such a Darwinian 
transition would look like, mathematically. The model 
described in the next section is deliberately minimal. It 
leaves out all the biology of ribosomes, proteins, genetic 
codes, and the like. What remains is an attempt to cap¬ 
ture the essence of Woese’s speculations. In place of 
a community of progenotes, we consider a community 
of abstract genomes, represented by binary sequences. 
They interact via HGT, and are subject to mutation, 
selection, and drift on a fitness landscape. Our work 
suggests that HGT-dominated dynamics may have been 
intermittently interrupted by selection-driven processes 
during which genotypes became fitter and decreased their 
inclination toward HGT. Such stochastic switching in the 
nonlinear population dynamics may have destabilized the 
HGT-dominated state and thus led to a Darwinian tran¬ 
sition and the emergence of the first species in early evo¬ 
lution. 

On a side note, an interesting mathematical aspect of 
the model is that it necessarily involves three-point in¬ 
teractions, since HGT transforms one genotype into a 
second by importing pieces of a third. Thus the model 
provides a natural biological example of a complex hy¬ 
pernetwork. Until now, most models in evolutionary dy¬ 


namics and population biology did not need to go beyond 
ordinary network structure, with two-point interactions 
between nodes connected by links. 


II. EVOLUTIONARY MODEL 

To explore the consequences of HGT on evolution, we 
consider a model community of N progenotes evolving 
on a fitness landscape [23|, |53| in the presence of selec¬ 
tion, mutation, drift, and HGT. Each progenote carries 
a genome of length I composed of a sequence of the bases 
0 and 1. The genome of progenote i determines its fit¬ 
ness /j. The progenotes reproduce by the Moran pro¬ 
cess [23|,l5^, i.e., each progenote reproduces randomly in 
time, with its reproduction rate given by its fitness fi. 
Whenever a progenote of genotype i reproduces, an off¬ 
spring is added to the population which is either identical 
to genotype i or a mutant of genotype j with probabil¬ 
ity Instantaneously after such a reproduction event, 
one progenote in the population is chosen randomly to die 
and is hence removed from the population. We assume 
that one mutation event will only affect one of the bases 
of the genome, so that the Hamming distance between 
genotypes i and j is 1. 

Hence, our fitness landscape may be represented by a 
network where the different genotypes are the nodes of 
the network and the possible mutations form the links. 
Assigning two different bases, 0 and 1, and given the 
structure of the mutations, the resulting network is an 
^dimensional hypercube (Fig. [T|). 

The fitness landscape underlying our model is assumed 
to be a Mount Fuji landscape |23|: The highest fitness is 
assigned to one single genotype, the peak. Other geno¬ 
types are assigned lower fitness: the farther away from 
the peak in genotype space, the lower the fitness. Thus, 
a single-peaked mountain landscape is created on geno¬ 
type space, and a population eyolving purely through the 
processes of selection and mutation should converge to 
this peak. Note that the Moran process described above 
is a random process. It thereby constitutes a minimal 
model intrinsically including the effects of selection, mu¬ 
tation and genetic drift The latter is induced by the 
stochastic selection in the combined process of reproduc¬ 
tion, mutation and death and has the effect of randomly 
walking the population around in genotype space even if 
no fitness differences were present. 

To reveal the potential impact of HGT we incorpo¬ 
rate its basic features into the stochastic evolution model. 
Two progenotes A and B may meet and a subsequence 
s of progenote H’s genome may be inserted into A’s 
genome. As a result of this horizontal gene transfer event, 
the genotype of progenote A will transform into another 
genotype C, determined by its original genotype and the 
subsequence s. This process is illustrated in Figured] 

To model this process we add HGT-hyperlinks to the 
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Figure 1. HGT-hyperlinks introduce three-genotype 
interactions to the evolutionary dynamics, creating 
hypernetwork dynamics. This schematic example il¬ 
lustrates the insertion of an HGT-hyperlink (red solid and 
dcished) to the sequence space of genomes of length I = 4. 
Individuals of genotype A — 0100 take up the first two bases 
11 from genotype B = 1101 which are inserted at position 
three into A — OllTOO. After the last two bits are cut from 
the sequence, an progenote of genotype A becomes of new 
genotype C = 0111. 


hypercube network representing the fitness landscape. 
One such hyperlink symbolizes a three-genotype inter¬ 
action and is defined through the following process. We 
choose two genotypes A and B randomly as well as a 
random subsequence of genome B with length between 
X = 2 and x = I — 2 bases. This subsequence is inserted 
at a random position of genome A. The remaining x 
bases at the end of A’s sequence are cut off, keeping the 
sequence length of A constant. The new sequence deter¬ 
mines a genotype C, which genotype A becomes on inter¬ 
acting with B via this HGT-link, denoted {A, B, ( 3 ). If 
the resulting genotype C is identical to A, this HGT-link 
would not alter the population dynamics and would thus 
be irrelevant. We therefore neglect such self-projecting 
HGT-links. We repeat the above procedure until a pre¬ 
defined number m of new HGT-links has been added to 
the system. 

An HGT-link defines one type of HGT-event, in which 
part of genotype A is replaced by part of genotype B 
and is thereby transformed to genotype C. We consider 
these events to occur independently of each other. Let 
kx denote the number of progenotes of genotype X in 
the population. Then the HGT-events above occur at a 


rA^C = c-kA^- ( 1 ) 

Here the effective competence for HGT is modeled as 
a constant c > 0 that captures both the rate at which 
the progenotes meet and their actual preference for the 
initiation of an HGT event, given that they meet. 

Note that interactions of the form ([T]), independent of 
any model details, imply collective dynamics on a com¬ 
plex hypernetwork, due to their intrinsic three-genotype 
coupling involving A, B, and C. The dynamics of hor¬ 
izontal gene transfer in biological systems depends on 
a multitude of factors, including the mode (e.g., natu¬ 
ral transformation or conjugative transfer) of HGT [T7| . 
and may vary with the fitness of the donor and recip¬ 
ient [471 . 1 5 511 a nd other factors such as environmental 
conditions |56l |. To focus on qualitative mechanisms, we 
here consider the simplest setting where c is just a non- 
negative constant. We note that, via the factors kA, ks 
and the presence or absence of HGT-links {A, H, ( 3 ), the 
actual rate of all HGT events in the population still de¬ 
pends on how the population is distributed in genotype 
space. 


III. QUANTIFYING STOGHASTIC 
SWITCHING 


To see how HGT influences the evolutionary dynam¬ 
ics we study how the collective model dynamics depends 
on the competence c. The population sizes ki(t) of 
progenotes of different genotypes i present in the pop¬ 
ulation fully describe the state of the system at time t. 
We introduce the population entropy 


S{t) 



i=Q 


log 


kjjt) 

N 


( 2 ) 


to quantify how broadly the population is distributed in 
genotype space. Populations consisting of only one geno¬ 
type have population entropy zero. If the population is 
uniformly spread out in genotype space, the population 
entropy takes its maximal value S'^ax = ^log(2). 

Direct simulations of the stochastic dynamics reveal 
that for large competences c, the collective dynamics con¬ 
verge to a state of high population entropy where the 
population is highly spread out in genotype space (Fig- 
ure[2K) ■ It may only transiently switch to a state localized 
in state space, i.e., with relatively little spread in genetic 
material. In this high entropy state the total HGT rate 
in the population is orders of magnitude higher than in 
a speciated state (see below). The population does not 
adapt to the underlying fitness landscape; in that sense, 
HGT is the main driving process in this large-c scenario. 
We identify this state of high population entropy with a 
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Figure 2. The population dynamics is dominated by a speciated state for low competence and a distributed 
state for high competence. Shown are example dynamics of the population entropy for competence c = 5 (a), c = 3 
(b) and c = 1 (c) in an example population of = 1000 progenotes with genome length I — 7. (a): For high competence 
c the population entropy almost always fluctuates around a high value for all initial conditions, (b): The dynamics switch 
stochastically to a low entropy state and stay there longer for lower values of c. (c): The low entropy state is rendered globally 
stable for low c so that the population entropy fluctuates slightly above zero for all initial conditions. In the low entropy state 
the population dynamics are driven by selection, in the high entropy state by HGT. Panels (i) show the entropy dynamics, (ii) 
the average fitness (/) of the population corresponding to the entropy dynamics and (iii) the corresponding HGT rates thgt 
that the population exhibits at time t. For low population entropies the fitness is high and HGT rate small and vice versa for 
high population entropies. The mutation probability was set to fj,ij — 0.0001. Into the resulting Fujiyama fitness landscape 
with fitness values between /min = 0.9 and /max = 1.1 we inserted m = 2000 HGT-links. 


pre-Darwinian collective state, as in this state no distinct 
species can be distinguished and HGT is the dominant 
force driving the evolutionary dynamics. 


In contrast, if the progenotes’ competence for HGT is 
low, we observe a population dynamics which converges 
toward a state of low population entropy (Figure [2J:). 
This confirms the observation that selection, mutation 
and drift will drive a population to adapt to a fitness 
landscape if the mutation rate is not too high ^ 441. 
The population is thus concentrated around the fittest 
genotype with only rare mutations and genetic drift caus¬ 
ing some spread of the population. As a consequence, 
large parts of the population exhibit the same or similar 
genotypes such that it is in a speciated state. 


While the system spends almost all time close to its 
speciated state for low competence, the dynamics switch 
stochastically between the speciated and the distributed 
state if the competence is not small enough. Figures [5^- 
c show that the higher the progenotes’ competence for 
HGT is, the longer the system stays in the distributed 
state. 


We conclude that both the speciated state and the 
distributed state are dynamically accessible metastable 
states (for high enough competence). The dynamics only 
switch from one of these states to the other due to rare 
events in the stochastic dynamics. This is supported by 
the fact that the dynamics switch between these states 
on much shorter time scales than the time they remain 
in them (see also Figure |3] below). 

How does the distributed state disappear for low com¬ 
petences? To answer this question we developed a 
method based on the population entropy defined in (®) to 
study the forces induced on the dynamics by reproduc¬ 
tion and HGT. The evolution dynamics are event-driven, 
and the population entropy S may only change at these 
event times. At each event there is a population entropy 
S- directly before the event and a population entropy 
5'+ directly after the event. The change of population 
entropy 

AS = S+- S- (3) 

induced by a single event will in general depend on the 
type of event (reproduction or HGT) and the actual dis- 
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Figure 3. The high entropy state is dynamically sta¬ 
ble also for vanishing mutation probabilities. Shown 
is the measured percentage of time a population stayed in 
the distributed state for a system with mutation probabili¬ 
ties Hij = 10“® (blue), = 10“"^, ^ij = 10“® (orange), 
^ij = 10“® (green) and fiij — 0 (gray). Qualitatively, the 
results are similar, only for higher mutation probabilities the 
critical transition occurs at a lower value Ccr- System param¬ 
eters were I — 7, N = 1000 and m = 3000 HGT-links were 
introduced into a Fujiyama fitness landscape with fitness val¬ 
ues between /min = 0.9 and /max = 1.1. Each datapoint was 
obtained in a simulation of length T = 10® with the initial 
condition 5'(0) = 5'max. 


tribution of the population over genotype space. If the 
population is in a state with population entropy S, one 
event will thus induce a mean change AS{S) averaged 
over all events occurring at population entropy S. The 
rate r{S) at which these events occur depends on the 
state of the system as well. Multiplying the mean change 
induced by the single events with the rate at which the 
events occur, we obtain the average rate of change 

^=r(5)-A5(5) (4) 

induced on the dynamics. The reproduction and HGT 
events in our model occur independently of each other, 
so that their contributions separate additively according 
to 


disappears in a saddle-node bifurcation and the popu¬ 
lation converges to a speciated state. Furthermore, our 
analysis reveals that HGT alone can drive a population 
into a distributed state, even in a total absence of muta¬ 
tions (Figure [3]). 




Figure 4. The distributed state disappears for low 
HGT competence. At a critical competence the dynam¬ 
ical fixed point at high population entropy is destroyed in a 
saddle-node bifurcation. Here we show the analysis of the 
system yielding the dynamics illustrated in Figure [5] Panels 
(a) and (b) show the rate of change of the population entropy 
due to reproduction and HGT obtained with the methods de¬ 
scribed in the Methods section. The colors indicate the rate 
of change for competence values c = 1 (blue), c = 3 (red) and 
c = 5 (orange). Adding the results from (a) and (b) accord¬ 
ing to equation yields the overall rate of change S for the 
dynamics shown in (c). The arrow indicates the fixed point 
at high population entropies emerging through a saddle-node 
bifurcation for increasing the parameter c. With equation © 
we define a potential V (S) for the dynamics which is shown 
in (d) for the competences c = 1 (blue), c = 3 (red) and c = 5 
(orange) and additionally for c = 0.5 (gray), c = 2 (green) 
and c = 4 (black). The potential valley at high population 
entropies emerges between c = 1 and c = 2 so that the critical 
competence must lie between these two values. Each dataset 
was obtained in simulations measuring the dynamics for a 
time T = lO’’. 


dS _ d^Repr d'S’nGT /gx 

dt dt dt 

= mepr(<S') ■ A5'Repr(S') + ?'hGt(5') • AS'hGT (S') (6) 


Why do the dynamics almost always remain in the high 
entropy state for high competence? Using the average 
rate of change S{S) we define a potential 


We measured these functional dependencies in simula¬ 
tions of the dynamics (for more details see Supplemen¬ 
tary Information), thereby obtaining the forces induced 
by reproduction and HGT which drive the population en¬ 
tropy dynamics (Figure©. The bistability of the dynam¬ 
ics emerges because the impact of HGT increases with the 
diversity of the population. Thus, if the population en¬ 
tropy is high, HGT will drive it toward even higher pop¬ 
ulation entropy, and hence toward the distributed state. 
However, if the competence drops below a critical value, 
the impact of HGT on the population’s dynamics is al¬ 
ways smaller than that of selection, independent of the 
diversity of the population. Thus, the distributed state 


V{S) = - [ S{S')dS' 

Jo 


(7) 


in which the dynamics move under additional stochastic 
forcing. This potential is shown in Figure ©1. Accord¬ 
ing to reaction rate theory the depths of the two 
stable states’ potential wells determine the average time 
the dynamics stay close to each of the stable states. As 
the potential well at the distributed state becomes ever 
deeper for higher competence the dynamics hence stay 
ever longer in this state. 

Thus, our results suggest that when progenotes had 
high competence for HGT in early evolution, a dis- 
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tributed state was dynamically stable. If competence 
then decreased below a critical value, the distributed 
state may have disappeared and triggered the emergence 
of the first species. For this Darwinian transition to oc¬ 
cur, the population’s competence must have decreased 
dynamically in the distributed state. How could this have 
happened? 


C5 ® 

’'I 



Time 


Figure 5. A possible scenario for the evolution of dis¬ 
tinct species from a pre-Darwinian distributed state. 

The three time series sketched here are not simulation data, 
but encapsulate the speculations in the text, showing how the 
average competence, the average fitness, and the population 
entropy may evolve in the transition from a distributed state 
to the first distinct species. In the initial state (marked in 
blue) the competence is high, so that HGT drives the dynam¬ 
ics; the population exhibits a high population entropy and 
low average fitness. Through a stochastic switching the dy¬ 
namics reaches a state of low population entropy where the 
fitness is higher as the population adapts to the fitness land¬ 
scape. Here the population could evolve slowly toward lower 
competence. Thus, the dynamics switch back and forth be¬ 
tween the low and the high entropy state remaining longer 
and longer in the low entropy state as the competence de¬ 
creases (marked in red). When the competence goes below a 
critical value (marked by the dashed line in the top panel) the 
high entropy state disappears (marked in green), the dynam¬ 
ics remains in the low entropy state, the population’s average 
fitness increases and the first species may robustly evolve. 


A decrease in competence may rely on a mechanism 
that combines the stochastic switching uncovered above 
with the suggestion that fitter populations may tend to 
be less prone to HGT events, as schematically illustrated 
in Figure [51 As the speciated state is always stable, even 
if the population’s competence is high, the population 
dynamics will stochastically switch to this state repeat¬ 
edly for relatively short times. In the selection-dominated 
state (i.e., at low S) the population’s fitness increases. 
A fitter population that might be less prone to HGT 
events, as suggested recently [^, has a decreased over¬ 
all competence (lower c in our simplified model setting). 
Smaller c in turn increases the stochastic residence times 
the population spends in the selection-dominated state. 
This combination of two mutually amplifying contribu¬ 


tions (decreasing HGT rate and increasing fitness in the 
population) may yield decreasing competence in the long 
term such that after sufficiently many switches to the 
low-S” state, the competence may drop below a critical 
value where the distributed state disappears. The popu¬ 
lation then stays localized around the fittest genotypes, 
thus marking the time of transition to Darwinian evolu¬ 
tion. At this time, the first species can robustly emerge. 
The scenario shown schematically in Figure |5| illustrates 
one potential course of such repeated switching dynam¬ 
ics, with temporarily increased phases of higher fitness 
and decreasing HGT competence on long time scales. 


IV. CONCLUSION 


Our results provide a first glimpse of the possible dy¬ 
namics that may have led to the emergence of the first 
species from a distributed state dominated by HGT. We 
demonstrated that a high competence for HGT in a pop¬ 
ulation may suffice to drive the population into a dis¬ 
tributed state (Figure [2]). In this state HGT dominates 
the dynamics, in the sense that it inhibits the popula¬ 
tion’s ability to adapt to the underlying fitness land¬ 
scape and thus prevents it from crystallizing into dis¬ 
tinct species. Our analysis revealed that, independently 
of the mutation rate exhibited by the population, HGT 
can drive the population dynamics into a state where 
the population is widely spread out in genotype space 
(Figure [3]). We identify this state with a pre-Darwinian 
collective state envisioned by Woese [H-Iiil. 

Similarly, a state where no distinct species exist can 
emerge if the mutation rate in the population is too 
large (il . [s^ [^ . Above a critical mutation rate (the 
error threshold) the population cannot adapt to the un¬ 
derlying fitness landsca pe a nd will always evolve toward 
a quasispecies state 3 58, 3| similar to the distributed 


state induced by HGT shown above. However, there is a 
fundamental difference in the dynamics induced by HGT 
and that induced by mutations: While a mutation rate 
above an error threshold will always lead to a quasis¬ 
pecies state [3|i high rates of HGT as studied above in¬ 
duce a bistability of the dynamics where the distributed 
state coexists with a localized “speciated” state of low S. 
This coexistence may be essential for the evolution to¬ 
ward lower competence in a population and thus for the 
emergence of the first species; the coexistence is what 
enables a population originally in a distributed state to 
repeatedly switch to a low-S” state. As selection plays a 
major role in such a low-S” state, progenotes with lower 
competence would be selected for. Thus, with time, the 
entire population would evolve toward lower competence 
until the distributed state disappears as selection effects 
dominate the dynamics and the first species emerge. 

For the breakdown of the distributed state it is essen¬ 
tial that the population evolves toward a lower compe- 












tence. That the latter may in principle be possible was 
already suggested by Vogan and Higgs [^. Our results 
on an idealized model now demonstrate how stochastic 
switching and fitness-dependent competence may com¬ 
bine to create a transition from a bistable state to a 
speciated-only state. They in particular also suggest that 
HOT may be present at similar competence levels be¬ 
fore and after the emergence of the first species. From a 
complementary perspective, whereas one or a few species 
may already have existed, other population parts may 
still have been mixed without any clear species. So the 
very first species may only have marked the beginning 
of the decline of genuinely non-specific life, with other 
Darwinian transitions to follow. 
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